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METHOD AND SYSTEM FOR DE- JITTERING 
TRANSMITTED MPEG-2 AND MPEG-4 VIDEO 

Cross-Reference to Related Applications 

This application claims priority under 35 U.S.C. § 119 based on U.S. Provisional 
Application Serial No. 60/167,339, filed November 24, 1999, the disclosure of which is 
incorporated by reference. 

Statement Regarding Federally Sponsored Research Or Development 

This invention was made with Government support under Contract No. DAAL- 

01-96-2-0002, awarded by the U.S. Army Research Laboratory, The Government has 

certain rights in this invention. 

Field of Invention 

This invention relates generally to the field of multimedia transmission over a 
network. More specifically, this invention relates to method of de-jittering MPEG-2 and 
MPEG-4 video data transmitted over a packet switched network. 

Background of the Invention 

The MPEG-2 and MPEG-4 standards are well-known in the art for coding and 

storing multimedia video and associated audio information. When MPEG multimedia 
data is transmitted over a network from a source device to a destination device, it is 

important that the transmitted data be spchronized at the destination device by matching 

the destination device's clock to the source device's clock. It is known in the art to use a 
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phase locked loop (PLL) at the destination device to synchronize the source device's 
clock with the destination device's clock. 

Generally, as is known in the art, MPEG-2 and MPEG-4 standards call for 
multimedia data to be coded and stored in discrete data packets. The format of each data 
packet provides for a "clock-stamp" reference value in which a time reference value from 
the source device's clock can be stored prior to transmission across the network. When a 

stream of data packets are transmitted over a network, only a selected sample of the data 
packets actually include a clock-stamp time reference stored in the reserved data bytes. 

The destination device compares the clock-stamp time references that it receives in the 
transmitted MPEG data with the instant time provided by the destination device's local 

clock. From this comparison, a phase error can be derived. A PLL uses the phase error 

to adjust the decoder clock. Methods of comparing clock-stamp time references with the 

destination device's clock to determine a phase error and enable a PLL to adjust the 
destination device's clock to match the source device's clock are known in the art. 

For purposes of synchronizing the device's respective clocks, MPEG semantics 
assume a constant delay network between the source device and the destination device. 
However, it is difficult, if not impossible, to maintain a constant network delay. Non- 
constant network delays, known as "jitter", can result in a degradation of the video 
playback. Jitter results in data packets arriving at the destination device in a non-uniform 
manner, which impedes effective clock synchronization by the PLL. Specifically, the 
PLL must perform additional filtering in order to correctly estimate the STC clock values. 

This, in turn, slows down the responsiveness of the PLL and affects the maximum phase 

error introduced by the PLL between the clock-stamped reference values encoded from 
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the source device's clock and the corresponding destination device's time clock 

references. To assure a stable recovery of the source device's clock values (also referred 

to as the system clock (STQ) by the PLL, de-jittering algorithms must be performed 

before the encoded clock values are passed to the PLL. 

Brief Summary of the Invention 

The present invention comprises an improved method and system for reducing 
jitter in MPEG data transmissions due to non-constant network delay times. Generally, 

the present invention calculates a statistical estimation of the average network system 
jitter. The estimated average network system jitter is then used to re-calculate a 

"corrected" reference value for subsequent clock-stamp reference values. Specifically, 
for each data packet that contains a clock-stamp reference value, the clock-stamp 

reference value is parsed out from the rest of the data packet. The average network jitter 
is estimated based on a prior pre-determined sample of data packets. An estimated jitter 
is then calculated for the reference data packet. The estimated reference jitter is then 

translated to clock tics and a "corrected" clock-stamp reference value is calculated. 
Finally, the original clock-stamp reference value of the subsequent reference data packet 
is replaced with the "corrected" clock reference value, which includes compensation for 
the statistical estimation of network jitter, before it is sent to a phase locked loop (PLL). 
Since the new clock reference values are "corrected" based upon the statistical estimation 

of the average network system jitter, the phase error of the PLL is minimized, resulting in 

a more stable system time clock (STC). Among other benefits, the present invention 
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improves the quality of the received video and enables the system to tolerate more 

network jitter without video degradation. 



Description of the Drawings 

5 Figure 1 shows a representative MPEG data packet comprising a header portion 

and a payload portion. 

Figure 2 is a flowchart illustrating the steps of the present invention. 

Figure 3 comprises timing diagrams that illustrate the relative times between the 
transmission and receipt of MPEG data packets. 

10 

Detailed Description of a Preferred Embodiment 

MPEG-2 and MPEG-4 video standards provide for multimedia data to be coded 
and transported in data packets. As shown in Figure 1, each MPEG-2 and MPEG-4 data 
packet comprises a header portion and a payload portion. As is known in the art, the 

15 header portion of the packet contains administrative information about the data packet, 

such as packet ED, transport priority, etc. The payload portion of the packet contains 
video and audio data. Depending on the format of the data packets (either MPEG-2 or 

MPEG-4), each header portion contains a Program Clock Reference (PCR) or Object 
Clock Reference (OCR), both of which correspond to the source device's clock at the 
20 time the reference data packet is transmitted. PCR or OCR data is included periodically 

in data packets transmitted from the source device to the destination device, and the data 
is used to synchronize the system clock reference (STC) at the destination device with the 
clock at the source device. 
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Figure 2 shows a flow-chart that illustrates the steps of the present invention, and 
Figure 3 shows a source device time line and a destination device time line that illustrates 
the relative timing of data packets transmitted from a source device to a destination 

device. The various time intervals shown in Figure 3 assist in illustrating the steps shown 
in Figure 2. 

Referring to Figures 2 and 3, it is assumed that the source device transmits a 
stream of MPEG data packets with a constant nominal period T ~ i.e., with a constant 

nominal time period between each data packet transmission. The particular value of 
period T depends upon the application, and the present invention can be used in 
connection with any period T. As shown in Figure 3, data packets are transmitted with 
period T from the source device at departure times (D t ). Arrival times A t correspond to 

theoretical arrival times at the destination device, assuming a constant delay network with 
delay time D re f. However, because the network has a non-constant delay time, the actual 
arrival times differ from the theoretical arrival times. The actual arrival times are 
designated in Figure 3 as A a . For each data packet that arrives at the destination device, 
the differences between the actual arrival time A a and the theoretical arrival time A t 

constitutes the jitter J for that particular data packet. Each data packet that arrives at the 

destination device is stored in a computer memory device that is of the type that is well- 
known in the art. 

It is assumed that a clock-stamp reference value carrying a snap shot of the value 
of the clock at the source device is periodically stored in the header portion of data 
packets and sent every T re f time, or every N packets. Again, the particular frequency with 
which reference clock values are inserted — the particular values of T re f and/or N — do not 
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affect the applicability of the present invention. The value of T re f could be an inter-PCR 
or inter-OCR period, as is known in the art, depending upon the specific transport 

mechanism used. Each data packet that contains a clock-stamped reference value is 

considered a reference data packet. 

In step 22 of Figure 2, the destination device's clock is initiated to the clock- 
stamp reference value of the first data packet received by the destination device that 
includes a clock-stamp reference value. The initiation of the destination device's clock in 

this manner is done by default as per the MPEG standards. In step 26, a destination 
device counter is also initiated to the reference clock value to be used in connection with 

de-jittering. The destination device counter is used to register the actual arrival times of 
the incoming MPEG data packets. 

Per step 28 of Figure 2, the arrival time of the first reference data packet carrying 

a clock-stamp reference value is registered and saved. Then, as shown in step 30, the 
actual arrival times (A a ) of all subsequent packets (N) received between two successive 
reference data packets are registered and saved. The arrival times are stored in computer 
memory devices that are well-known to those skilled in the art. The actual arrival times 
(A a ) of the N packets are referred to herein as A ai , where i=l to N. In step 34, the 

theoretical arrival times (A t ), assuming a constant delay network, of the N packets are 

calculated using the actual arrival time of the reference data packet (A ref i) as a reference 
point. Specifically, the theoretical arrival times (A^, where i=l to N) are calculated as 

follows; 

A ti - A aren + i * T, 
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where A are fi represents the actual arrival time of the most currently-received reference 
data packet. As shown in step 38 of Figure 2, a jitter value (J i? where i=l to N) is 

calculated for each received data packet by subtracting the actual arrival times (A a ) from 

the theoretical arrival times (A t ) according to the following formula; 

Ji = Ati — Aaj. 

After all of the jitter values (Ji) have been calculated for the current subset of N 
data packets, a sample mean jitter (|n) is calculated, as shown in step 42, according to the 

following formula: 

N 

JJL = (l/N)*Zji 

i=l 

The calculated sample mean jitter value Qi) can be positive, negative, or zero depending 

on the delay (D re f) experienced by the reference data packet and the number of data 
packets (N) in the sample subset. The sample mean jitter (jut) represents the average 
network system jitter over the current sample of N data packets. 

Based upon the calculated sample mean jitter value the jitter of the next 
reference data packet is estimated. Specifically, as shown in step 46, a "corrected" 
theoretical arrival time (A ctre f2) is calculated for the next reference data packet according 
to the following formula: 

A c tref2 = (Ami + (Ntl) *T)-|A 

According to the above formula, the corrected theoretical arrival time of the next 
reference data packet (A ctre f2) is determined by calculating the uncorrected theoretical 

arrival time (A aref] + (N+l) * T) and subtracting the estimated mean network jitter (ji), 
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The corrected theoretical arrival time of the next reference data packet (A ctr ef2) is 

used to calculate the jitter associated with that data packet (J re f2) . After the next 
reference data packet containing a clock-stamped reference value is received (step 48), 

the jitter of that reference data packet is calculated by subtracting the actual arrival time 
from the corrected theoretical arrival time according to the following formula, as shown 
in step 50: 

where A ctref 2 is the corrected theoretical arrival time of the next reference data packet and 
A are f2 is the actual arrival time of the next reference data packet. The corrected 

theoretical arrival times and the jitter values of the clock-stamp reference values are 
determined by an electronic controller that is of the type that is well-known in the art. 

The corrected theoretical arrival time of the newly-received reference data packet 
is then used as a reference point for the calculation of the sample mean jitter of the next N 

data packets. Specifically, the sample mean jitter of the next N data packets is calculated 
as described above, except that the corrected theoretical arrival reference time (A ctre f2) 
replaces the actual arrival time reference (A^n) described hereinabove. Since the jitter 

calculation of the next N packets is based on a clock-stamped reference time that 

incorporates compensation for an estimated average network delay, the value of |i for the 
following sets of N data packets should be close to zero and exhibit little variation under 
the same network operating conditions. 

In step 54, the jitter value (J re f2) is translated to an adjustment step (A) in terms of 

the number of STC tics, according to the following formula: 

A = J re f2 * STC resolution, 
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where J re f2 is measured in seconds, and STC resolution is in tics per second. Based on the 
A value, a corrected clock-stamp reference value is calculated. As shown in step 58, the 

corrected clock-stamp reference value, which includes compensation for the average 
network delay, replaces the actual clock-stamp reference value stored in the reference 

5 data packet before it is sent to the PLL. Replacing the received clock-stamped time 

reference with the calculated corrected clock-stamp time reference before it is sent to the 

PLL minimizes the phase error of the PLL and provides a more stable STC 

reconstruction. 

The above-described process is repeated, as shown in Figure 2, each time a new 

10 reference data packet having a clock-stamp reference time included therein is received by 
the destination device. In this way, the actual clock-stamp reference value of each 

reference packet is replaced with a corrected clock-stamp reference value that 
incorporates compensation for the network system jitter. As a result, the destination 

device's PLL is more effective in recovering the system clock STC, which improves the 
15 quality of the video playback at the destination device. 

While a preferred embodiment of the present invention has been described herein, 
it is apparent that the basic construction can be altered to provide other embodiments that 
utilize the processes and compositions of this invention. Therefore, it will be appreciated 
that the scope of this invention is to be defined by the claims appended hereto rather than 

20 by the specific embodiment that has been presented hereinbefore by way of example. 
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Claims 

L A method of compensating for non-constant delay times of a network 
transmitting MPEG-2 and MPEG-4 data packets, comprising the steps of: 

estimating a network system jitter associated with reference data packets carrying 
clock-stamped reference values; and 

adjusting said clock-stamped reference values based on said estimated network 
system jitter. 

2. The method of claim 1, wherein said step of estimating a network system 

jitter comprises calculating a mean jitter value associated with a sample of data packets. 

3. The method of claim 2, wherein said adjusting step comprises the substeps 
of calculating an estimated jitter value associated with a subsequent reference data packet 
based on said mean jitter value; and adjusting said clock-stamped reference value of said 
subsequent reference data packet based on said estimated jitter value associated with said 
subsequent reference data packet. 

4. The method of claim 3, wherein said step of calculating an estimated jitter 
value associated with a subsequent reference data packet comprises the substep of 
calculating a corrected theoretical arrival time of a subsequent reference data packet 
based upon said calculated mean jitter value. 
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5. The method of claim 4, wherein said calculation of a jitter value 

associated with a subsequent reference data packet is based upon said corrected 
theoretical arrival time and an actual arrival time of said subsequent reference data 
packet. 

6. The method of claim 5, wherein said adjusting step further comprises the 
substep of translating said jitter value associated with said subsequent reference data 
packet to a corresponding number of clock tics. 

7. The method of claim 1, wherein said adjusting step comprises the substeps 
of calculating an estimated jitter value associated with a subsequent reference data 
packet; and 

adjusting said clock-stamped reference value of said subsequent reference data 
packet based on said estimated jitter value associated with said subsequent reference data 
packet. 

8. The method of claim 7, wherein said step of calculating an estimated jitter 
value associated with a subsequent reference data packet comprises the substep of 
calculating a corrected theoretical arrival time of a subsequent reference data packet. 

9. The method of claim 8, wherein said calculation of a jitter value 

associated with a subsequent reference data packet is based upon said corrected 

ii 
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theoretical arrival time and an actual arrival time of said subsequent reference data 

packet. 

10. The method of claim 9, wherein said adjusting step further comprises the 

substep of translating said jitter value associated with said subsequent reference data 

packet to a corresponding number of clock tics. 

11. A method of compensating for non-constant delay times of a network 
transmitting MPEG-2 and MPEG-4 data packets, comprising the steps of: 

periodically receiving data packets with a nominal period; 

detecting a clock-stamp reference value in a first reference data packet; 

calculating a jitter value of each data packet received subsequent to said first 

reference data packet until a second reference data packet having a clock-stamp reference 

value is detected; 

determining a sample mean jitter from said jitter values; 

establishing a corrected theoretical arrival time for said second reference data 

packet; 

, estimating the jitter of said second reference data packet; and 
adjusting said clock-stamp reference value of said second reference data packet. 
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12. An MPEG-2 and MPEG-4 transmission network, comprising: 

a source device that transmits MPEG-2 or MPEG-4 data packets with a nominal 

period; 

a destination device that receives said data packets; 

an electronic communication channel having a non-constant delay period that is 

coupled between said source device and said destination device to receive said data 
packets from said source device and provide said data packets to said destination device; 

said destination device comprising an electronic controller that calculates a mean 
jitter value for a sample of said data packets, estimates a jitter value for a subsequent 
reference data packet outside of said sample, and adjusts a clock-stamp reference value of 

said subsequent reference data packet based on said estimated jitter value. 
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Abstract 

The present invention relates to an improved method of de-jittering MPEG-2 and 
MPEG-4 data that is transmitted over a network. First, a network system jitter associated 
with periodic reference data packets is estimated. Then, the estimated system jitter is 

used to adjust clock-stamped reference values in the data packets before they are 

provided to the PLL for clock synchronization. The invented de-jittering method 

improves the PLL's ability to synchronize the MPEG data and provides for a better 

quality playback. 
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